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We consider the empirical process Gt of a one-dimensional diffu- 
sion with finite speed measure, indexed by a collection of functions 
jr. By the central limit theorem for diffusions, the finite-dimensional 
distributions of Gt converge weakly to those of a zero-mean Gaussian 
random process G. We prove that the weak convergence Gt G takes 
place in if and only if the limit G exists as a tight, Borel mea- 

surable map. The proof relies on majorizing measure techniques for 
continuous martingales. Applications include the weak convergence 
of the local time density estimator and the empirical distribution 
function on the full state space. 

1. Introduction and main results. Let X be a diffusion process on an 
open interval / = (/,?") C R, that is, a strong Markov process with contin- 
uous sample paths, taking values in /. Denote the corresponding laws by 
{P^ :a; G /} so that Xq = x under P^. Assume as usual that X is regular 
on I, meaning that for all x,y & I it holds that PxiTy < cxd) > 0, where 
Ty = inf{t ■.Xt = y}. Under this condition, the scale function s and the speed 
measure m of the diffusion X are well defined. The scale function is a contin- 
uous, strictly increasing function from / onto M, which implies in particular 
that the diffusion is recurrent. The speed measure is a Borel measure that 
gives positive mass to every open interval in I (cf. [9, 11, 24, 25]). 

We will assume throughout that the speed measure m is finite, that is, 
m(/) < cx). We denote the normalized speed measure by fi = m/m{I), and 
the distribution function corresponding to /i by F. The finiteness of m im- 
plies that the process X is in fact positive recurrent, and /x is the unique 
invariant probability measure. Hence, by the ergodic theorem, it a.s. holds 



Received March 2003; revised August 2004. 

AMS 2000 subject classifications. 60J60, 60J55, 60F17, 62M05. 

Key words and phrases. Diffusions, continuous martingales, local time, majorizing mea- 
sures, uniform central limit theorem, Donsker class, local time estimator. 

This is an electronic reprint of the original article published by the 
Institute of Mathematical Statistics in The Annals of Probability, 
2005, Vol. 33, No. 4, 1422-1451. This reprint differs from the original in 
pagination and typographic detail. 



1 



2 



A. VAN DER VAART AND H. VAN ZANTEN 



that 



tJo 

for / S L^ip)- It is well known that under the stated conditions, the diffusion 
also obeys a central limit theorem. It states that for every function / G L^ifJ^) 
we have the weak convergence 



(1.1) 



Vt 



1 /•* 



tJo 



f{Xu)du- / fd^l]^N{Q,T{fJ)) 



as t ^ oo, provided that the asymptotic variance 

r(/, /) = 47n(/) {^j^ f{y)Kdy) - F{x) f{y)li{dy)^ ' ds{x) 

is finite (see, e.g., [17]). Using the Cramer- Wold device, it is easy to obtain 
the multidimensional extension of this result. For every finite number of 
functions /i, • • • , /d S L^ifJ-), we have 



Vt 



/I 

tJo 



fi{Xu)du- / fidiJL 



\\ fdi^u) du- J^fddfi J 

r(/i,/i) 




r(/d,/i 



r(/i,/d)\\ 



rUd,fd)JJ 



where the asymptotic covariances T{fi,fj) are defined by 



r(/,<7)=4m(I) 



(1.2) 



f{y)fi{dy)-F{x) / f{y)Kdy)) 



l{y)fJ-idy) - F{x) / g{y)n{dy) ] ds{x) 



and the variances Yi^fi^fi) are assumed to be finite. 

In this paper we investigate the infinite-dimensional extension of the cen- 
tral limit theorem for diffusions. We let the function / in (1.1) vary in an 
infinite class of functions and derive necessary and sufficient conditions 
under which the weak convergence takes place uniformly on T. More pre- 
cisely, let J- C L^{^) be a class of functions and define for each t > the 
random map on T by 



(1.3) 



Gtf = VtQj^f{X^)du-pdfj}j. 
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The map Gt is called the empirical process indexed by J^. If sup^gjp / |/| c?^ < oo, 
the random map Gt is a (not necessarily measurable) random map in the 
space 1°°{J-) of uniformly bounded functions zi^F— > M, equipped with the 
uniform norm ||2:||oo = supj^j^ \^{f)\ [see (1.6)]. We say that the class is a 
Donsker class if the random maps Gt converge weakly in to a tight, 

Borel measurable random element G of i°°{J-). 

Since weak convergence in i°^{J^) to a tight Borel measurable limit is 
equivalent to finite-dimensional convergence and asymptotic tightness (see 
[1, 6], or, e.g., [29], Theorem 1.5.4), the multidimensional central limit theo- 
rem implies that the limit G must be a zero-mean, Gaussian random process 
indexed by with covariance function FiGfGg = T(f,g). Hence J- can be 
Donsker only if there exists a version of the Gaussian process G that is a 
tight Borel measurable map into . By general results on Gaussian pro- 

cesses this is equivalent to existence of a version of G whose sample paths are 
uniformly bounded and uniformly continuous on relative to the natural 
pseudo-metric that G induces on given by 

dlif,g)=E{Gf-Ggf. 

(In other words, the class .7-" is a GC-set in the appropriate Hilbert space, 
in the sense of Dudley [5] or [6]. Also cf. [8], or Example 1.5.10 of [29].) 
Surprisingly, the existence of the limit process is also sufficient for to 
be Donsker. In contrast with the situation for i.i.d. random elements no 
additional (entropy) conditions that limit the size of the class are required. 

It also turns out that the processes Gt themselves possess bounded and 
dc-continuous sample paths as well, whence the weak convergence actually 
takes place in the space Cf,{J^,diQ) of bounded, dc-eontinuous functions on 
T (cf. Theorem 1.3.10 in [29]). 

Theorem 1.1. Suppose that T is bounded in L^{fj,). Then T is Donsker 
if and only if the centered, Gaussian random map G on J- with covariance 
function FiGfGg given by (1.2) admits a bounded and d^-uniformly contin- 
uous version. In that case, for every x £ I, 

Gj =^ G inCb{J^,dG) ast^oo. 

In fact, we can prove a more general result. Since X is a regular diffusion, 
it has continuous local time {lt{x):t > 0,a; E /) with respect to the speed 
measure m. For every integrable function / the occupation times formula 
says that 



(1.4) 



^* f{Xu) du = J^ f{x)lt{x)m{dx). 
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This means that we can write the empirical process as 

Gtf = VtJ^fix) Qlt{x) - ^) m{dx). 

There is no special reason to look only at integrals of this specific type. With 
the same effort we can consider general integrals of the form 



m{I) , 

where A is an arbitrary signed measure on /, with finite total variation 
II A II . In this manner, we obtain a uniform central limit theorem for general 
additive functionals. 

So let A be a collection of signed measures on /. We define the random 
maps EI( on A by 



.A = V^/(i/,(x)-^)A(dx). 



Slightly abusing terminology, we call Hj the empirical process indexed by the 
class A. By the multidimensional central limit theorem, the finite-dimensional 
distributions of Hit converge weakly to those of a Gaussian, zero-mean ran- 
dom map EI on A with covariance function 

(1.5) EMAIHIz/ = / {\{l,x\-F{x)\{l)){y{l,x\-F{x)v(I))ds(x). 



m{I) J I 

As before, the Gaussian random map IH induces a natural pseudo-metric 
d|i(A, v) = E(IHIA - on the class A. 

If the total variations of the signed measures are uniformly bounded, that 
is, sup_xg^ ||A|| < oo, then, for every fixed t, 



(1.6) suplMtAI < \/tsup 

AgA x&I 



sup ||A|| < oo a.s. 
aga 



t ' ' m{I) 

Hence is a random map into the space (A) , and we can ask whether the 
weak convergence of to H takes place in ^°°(A), with a tight, Borel mea- 
surable limit process. If this is the case, we call the collection A a Donsker 
class. Again, the existence of the limiting process, which is obviously neces- 
sary, is also sufficient. As before, by general results on Gaussian processes the 
existence can be translated into the existence of a version of the Gaussian 
process H that has bounded and dm-uniformly continuous sample paths. 

Theorem 1.2. Suppose that sup;^^^ ||A|| < oo. Then A is Donsker if and 
only if the centered, Gaussian random map EI on A with covariance function 
(1.5) admits a bounded and d^-uniformly continuous version. In that case, 
for every x £ I, 

P 

EIj EI mCfe(A, (in) as t—>oo. 
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Theorem 1.1 is indeed a special case of Theorem 1.2, since Gj/ = H^A/, 
where \f{dx) = f{x)m{dx). 

The theory of majorizing measures provides necessary and sufficient con- 
ditions for the existence of bounded and de-uniformly continuous Gaussian 
processes on A in terms of the geometry of the pseudo- metric space (A, du)- 
See [7, 27], and Chapters 11 and 12 of [16]. We shall use this theory to prove 
our main theorem. Conversely, we can use it to deduce the following analytic 
characterization of the Donsker property. 

If (y, d) is a pseudo-metric space, we denote by Bd{y,e) the closed ball 
around y of d-radius e. 

Corollary 1.3. Suppose that sup;^^^ ||A|| < oo. Then A is Donsker if 
and only if there exists a Borel probability measure v on (A, liu) such that 



Proof. Combine Theorem 1.2 with Theorems 11.18 and 12.9 of [16]. 

□ 

In general, the majorizing measure condition is less stringent than the 
metric entropy condition introduced by Dudley [5]. However, the latter is 
often easier to work with in concrete cases. Therefore, it is useful to give a 
sufficient entropy condition for A to be Donsker. If (y, d) is a pseudo-metric 
space, we denote by N{£, Y, d) the minimal number of closed balls of d-radius 
£ that is needed to cover Y . 

Corollary 1.4. Suppose that sup;^g^||A|| < oo. Then the class A is 
Donsker if 



In view of definition (1.5) the covering number N{e,A,d^) is the L'^{s)- 
covering number of the class of functions 



These functions are of uniformly bounded variation and hence the full class, 
with the elements of A of uniformly bounded variation, possesses a finite 
L^((5)-entropy integral for any finite measure Q. (See, e.g., [29], Theo- 
rem 2.7.5.) Unfortunately, this observation is useless in the present situation, 
as under our conditions the measure defined by the scale function s is un- 
bounded. Under appropriate bounds on the tails of the envelope function 





X I— > 



\{l,x\-F{x)\{I) 



AG A. 
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of the class, it is still possible to exploit the fact that the functions are of 
bounded variation by a partitioning argument, as in Corollary 2.7.4 of [29]. 
Alternatively, for special A we can use the preceding corollary in combina- 
tion with VC-theory. However, the best results are obtained through direct 
application of Theorem 1.2, as this allows to exploit the fine properties of 
Gaussian processes. We illustrate this in Section 2 by several examples of 
interest. 

The Donsker theorem is based on approximation by a continuous local 
martingale and an analysis of local time. In Section 3 we present a uniform 
central limit theorem for continuous local martingales under a majorizing 
measure condition. This extends a result by Nishiyama [22], and is of inter- 
est on its own. In Section 4 we recall the necessary results on local time. 
Following these preparations the final section gives the proofs of the main 
results. 

Let us remark that because of the use of local time, our approach is 
limited to the one-dimensional case. In higher dimensions one has to resort to 
different methods, using for instance the generator of the diffusion to relate 
the empirical process to a family of local martingales. This approach was 
followed (for general stationary, ergodic Markov processes) by Bhattacharya 
[3] to obtain a functional central limit theorem for 



where / is one fixed function, t > and n — > oo. It is not clear, however, 
whether necessary and sufficient conditions can be obtained in this way. 

The notation a < 6 is used to denote that a < C6 for a constant C that is 
universal, or at least fixed in the proof. 

2. Examples. In this section we consider four special cases of Theo- 
rem 1.2. 

2.1. Diffusion local time. The first example is a uniform central limit 
theorem for diffusion local time. The space of continuous functions on a 
compact set J C M, endowed with the supremum norm, is denoted by C( J). 

Theorem 2.1. Suppose that Jj F'^ {I - F)"^ ds < oo. Then, for all x e I 
and compact J C /, 



in C{J), where G is a zero-mean Gaussian random map with covariance 
function 
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Proof. We apply Corollary 1.4 with A = {6x -x G J}, where 5x is the 
Dirac measure concentrated at x. The integrability of the function — 
F)'^ is equivalent to the finiteness of the covariance function of the limit G. 
To verify the entropy condition, observe that the pseudo-metric d that is 
induced by G on A is given by d{5x,Sy) = \/\s{x) — s{y)\. It follows that the 
space {A,d) is isometric to (s(J), -v/pT)- Since s{J) is compact, this implies 
that the entropy condition of Corollary 1.4 is satisfied. Hence, we have weak 
convergence in £°°{J), and therefore also in C{J), since diffusion local time 
is continuous in the space variable (see Section 4). □ 

We remark that the weak convergence of the normalized local time pro- 
cess, as in the preceding theorem, cannot be extended to uniformity on the 
entire state space /. By the continuous mapping theorem, uniform weak 
convergence in would imply weak convergence of the sup-norm to a 

finite limit. But since the function x i-^ lt{x) vanishes outside the range of 
{Xg : < s < t), which is strictly within / a.s., we have a.s. 



which would lead to a contradiction. 

On the other hand, we can construct a version of the limit process G 
with continuous (but not necessarily bounded) sample paths on the entire 
state space. Then the process ^/t{k/t - l/m{I)) indexed by I converges to 
G relative to the topology of uniform convergence on compacta. [To con- 
struct a version of G with continuous sample paths on I, first construct an 
arbitrary version G indexed by a countable dense subset Q C I. In view of 
the entropy bound obtained in the proof of Theorem 2.1 the modulus of 
continuity supj,^^^ j^g. |G(s) — G(t)| of the restriction of this process 

to a given compact J C I converges to zero in mean as (5 | 0. Thus up to 
a null set the process G is uniformly continuous on bounded subsets of its 
(countable, dense) index set. We can extend it by continuity to the whole 
state space I.] 

For later reference we note that, given the integrability of the function 
F^(l — -F)^, there exist positive constants ci,C2 such that, for all x £ I, 



Because the function s is unbounded, this too shows that there is no version 
of G with bounded sample paths. 




(2.1) 



ci(l + \s{x)\) < EG\x) < C2(l + |s(2;)|). 



2.2. Empirical process indexed by functions. In this section we give a 
sufficient condition for the weak convergence of the empirical process (1.3) 
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indexed by a general class T of functions. This covers many concrete exam- 
ples. However, for special classes of fmictions, such as indicators in the line, 
the result can be improved, as illustrated in the next sections. 

Let (1 + denote the measure with density (1 + vT*I) relative to 

F. 

Theorem 2.2. Suppose that JjF'^{l — F)"^ ds < oo. Then every class of 
functions C + ^/s) dF) that satisfies the entropy condition 



is Donsker. 



roo == 

/ VlogN{e,J^,L^{{l + J\s\)dF))de<oo 

JQ * 



Proof. In view of the occupation times formula Gtf = Vi Jj f{lt/t — 
l/m{I)) dm. Therefore, a version of the limit process HI must be given by 
EI/ = / fGdm, for G the limit process of the diffusion local time process 
obtained in Theorem 2.1. Because + ^/\s\)dm < oo by assumption 

and E|G| < 1 + vT^ t)y (2.1), this process is indeed well defined. It is easily 
shown that this process EI is a mean-zero process with the correct covariance 
structure, whence it suffices to check that it possesses a version with bounded 
and uniformly continuous sample paths. 

Now 

E{Mf-Mg)^ = ^{E\Mf-Mg\f<(^J \f-g\EGdmy. 

In view of (2.1) the intrinsic metric dfi{f, g) is bounded above by a multiple of 
the -^^^((1 + vT^) dm)-iiovm. of f — g. Hence the existence of the appropriate 
version of HI follows from [5] . □ 

Example 2.3. As a particular example, we may take any VC-class J- 
with an envelope function F such that 

IF(x)(l + \/|s(x)| )dm{x) < oo. 

I 

Then the covering number N{eQ¥,J',L\Q)) is bounded by C(l/e)^ for 
y -|- 1 the VC-index of the class J- and C a constant depending on V only, 
and any cj-finite measure Q such that Q¥ < oo. (See, e.g.. Theorem 2.6.7 
in [29], where it is clear from the proof that the result extends to cj-finite 
measures Q.) In particular, the entropy condition of the preceding theorem 
is satisfied, and hence J- is Donsker. 

Example 2.4. Another example is given by the collection of all mono- 
tone functions f :I ^ [0)1]- Because this has a finite entropy integral for any 
finite measure, this class is Donsker if Jj y^^\s\dF < oo. 
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Example 2.5. A third example is given by the collection of all functions 
/ : I ^ [0, 1] with \ f{x) — f{y)\ <\x — y\°' for some a > 1/2 in the case that 
/ is compact. This class has entropy relative to the uniform norm bounded 
above by a multiple of and hence satisfies the entropy condition of 

the preceding theorem if Jj ^y\s\dF < oo. 

Using the approach of Corollary 2.7.4 of [29], this can be extended to 
unbounded state space I = M under the condition that for some p <2/3 

y( [ {1 + VW^ ) dF {x)Y <oo. 

~^i\Jj<\x\<j+i J 

Analogy with the case of empirical processes for independent observations 
suggests that the class will remain Donsker if this holds for p = 2/3, but we 
have not investigated this. 

2.3. Local time density estimator. Suppose that the invariant probability 
measure /x has a locally bounded density / with respect to a measure u on /. 
Then it follows from the occupation times formula (1.4) that the empirical 
measure ^t-, defined by 

lit{B) = \ flB{Xu)du, 
t Jo 

has the (random) density 

j^^^-^ ^ m{I)f{x)lt{x) 

with respect to v. In the statistical literature this density ft is often called 
the local time estimator of /; see, for example, [4]. If v is the Lebesgue 
measure on / and / is continuous, then ft is simply the derivative of the 
empirical distribution function. 

Kutoyants [14] and Negri [19] studied the statistical properties of the lo- 
cal time estimator for regular diffusions on M that are generated by certain 
stochastic differential equations. In particular, for the special class of diffu- 
sions he considered, Kutoyants [14] showed that the normalized difference 
Vt{ft — f) converges weakly to a Gaussian limit, uniformly on the whole 
state space /. In this section we complement and generalize their results, 
giving precise conditions for general regular diffusions. 

The finite-dimensional distributions of Vt{ft — f) converge weakly to 
those of the centered, Gaussian random map H with covariance function 

EH(a;)H(y) = Am{I)f{x)f{y)j{t^^^,) - F)(l[,,,,) - F) ds, 

provided that these covariances are finite. The following theorem gives nec- 
essary and sufficient conditions under which this finite-dimensional conver- 
gence can be extended to uniform convergence, on compacta or on the full 
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state space I. Recall that we assume throughout that / is bounded on com- 
pact subsets of I. 

Theorem 2.6. (i) We have y/i{ft- f) in £°°{J) for every compact 

J and X £ I if and only if Jj — F)"^ ds < oo. 

(ii) We have Vi{ft — f) EI in £°"{I) for every x £ I if and only ifM. 
admits a version such that M{x) — > almost surely as x II or x] r. 

Proof. Because Vi{ft — f) = fVtik/t- l/m{I)), a version of the limit 
process H of Vi{ft — f) can be defined as HI = /G, for G the limit process of 
the local time process appearing in Theorem 2.1. In the following we use a 
version IHI = fG obtained from a version of G with continuous sample paths 
on the entire state space I. 

For any x £ I, 

Elf (x) = Am{I) f'^ (x) ( [ F'^ds+ [ {1 - F)'^ ds\ 

\J{l,x] J{x,r) J 

Therefore, the condition Jj F^{1 — F)'^ < oo is equivalent to the finiteness 
of EEI^(rE) for some x with f{x) > (and then for all x £ I), whence the 
condition is certainly necessary. 

(i) Since / is locally bounded, the map i°°{J) — > i°°{J) defined hy fz 
is continuous for the uniform norm. Because G is a tight Borel measurable 
element in 1°°{J), so is the process H = fG. Thus the assertion follows from 
Theorem 2.1. 

(ii) From (2.1) it follows that ElP{x) = f'^{x)EG^{x) is bounded on / 
only if the function f^s is bounded. Because s{x) — > ±00 as x approaches 
the boundary of /, it follows that in this case f{x) — > at the boundary of 
/. 

Because the sample paths x 1— > lt{x) of local time vanish for x near the 
boundary of the state space I and f{x) ^ as x tends to this boundary, the 
sample paths of the process Vt{ft — f) tend to zero at the endpoints of /. If 
Vi{ft — f) converges to a tight limit IH in then the sample paths of H 

must tend to zero at the boundary points also, as can be seen, for instance, 
from an almost sure construction. Thus the condition in (ii) is necessary. 

To prove sufficiency, it suffices to show that there exists a version of HI 
that is a tight, Borel measurable map into Let Jm be an increasing 

sequence of compact intervals with Jm T 1^ and let Mm = /Glj^ be the 
process indexed by / with sample paths equal to fG on Jm and equal to zero 
outside Jm- Because the restriction of G to Jm is a tight, Borel measurable 
map into C{Jm) C i°°{Jm) and Mm is the image of this restriction under 
the continuous map z 1-^ fzlj^ from i°°{Jm,) to the process Mm is a 

tight, Borel measurable map into £°°{I). The process EI = fG as constructed 



DONSKER THEOREMS FOR DIFFUSIONS 



11 



in the first part of the proof is separable, because it possesses de-uniformly 
continuous sample paths on every (Euclidean) compact interval J C /, which 
is de-totally bounded by tightness of M. This implies that this version of 
the limit process satisfies sup^^j^ PC^^)! ~^ almost surely, as m — > oo, as 
does the version of H in the statement of (ii) . Consequently, 

sup\mm{x)-M{x)\ < sup \m{x)\ ^0, 

almost surely. We conclude that the process EI is the almost sure limit in 
of a sequence of tight, Borel measurable maps into £°°(I). This implies 
that H is itself also a tight, Borel measurable map into £°°{I), in view of the 
lemma below. □ 

The following lemma gives an easily verifiable sufficient condition for the 
convergence of Viift — f) on the entire state space, which is necessary under 
a mild regularity condition. 

Corollary 2.7. Suppose that JjF'^{l — F)^ ds < oo. Then if the func- 
tion f'^(x)\s{x)\loglog\s{x)\ ^ asx^l orx^r, the convergence 

Viift-f)^M 

takes place in £^{I) for every x £ I. If the function f'^s is monotone near 
I and r, then these conditions are also necessary. 

Proof. We first prove sufficiency. Let G be the limit process in The- 
orem 2.1, H = fG and let W be a two-sided Brownian motion, emanating 
from zero. By the preceding theorem, it suffices to show that the sample 
paths of EI converge to zero at the boundary points of /. Observe that 
E((G(x) - G(y))2 = (4/m(/))E(W(s(x)) - W{s{y))f for all x,yel. More- 
over, by (2.1), we have EG^(x) < EW^(s(x)) for x such that \s{x)\ is bounded 
away from 0. It follows that for y >x £ I close enough to r, 

E{m{x)-m{y)f 

= Eifix)Gix)-fiy)Giy)f 

< ifix) - f iy)f EG\x) + f\y)EiG{x) - Giy)f 

< (fix) - f{y)rfEW\s{x)) + f\y)E{W{s{x)) - W{s{y))f 
= (fix) - f{y)fEW\six)) + /2(y)E(W(s(x)) - W(s(y)))=^ 

- 2/(y)(/(x) - f{y))EW{s{x))W{{s{y)) - W{s{x))) 
= E{f{x)W{s{x)) - fiy)W{s{y))f, 
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by the independence of the Brownian increments. If we define M{r) = and 
/(r)W(s(r)) = 0, then the processes H and /Wos are continuous in at r, 
as s(x)/^(x) — > as X t r by assumption. Consequently, under this extension 
of the index set the inequahty in the display remains valid for x,y £ [xo,r], 
for sufficiently large xq. It follows that 

E sup |E[(x)| <E sup {M{x)-M{y)) 

x>xo xo<x,y<r 

<2E sup M{x)<2E sup (/(x)W(s(x))), 

xo<x<r xo<x<r 

by Slepian's lemma. By the law of the iterated logarithm for Brownian mo- 
tion and the condition on /, we have that /(x)W(s(x)) almost surely as 
X t r. Therefore, the median of the variables sup^^^^^^. |/(x)W(s(x))| con- 
verges to zero as xq | r. As a consequence of Borell's inequality the mean 
of a supremum of a separable Gaussian process is bounded above by a mul- 
tiple of the median and hence the right-hand side of the preceding display 
converges to zero. We conclude that the sequence sup^^^^ converges 
to zero in mean, and by monotonicity also almost surely, for x — > r. Similar 
reasoning applies for x — > L 

Now assume that /2(x)s(x) i as X I r. Then / is also decreasing near r. 
Furthermore, for y> x, 

E{G{y) - G(x))G(x) = --^ J l[x,y)il[x,r) -F)ds< 0. 

We conclude that, for y >x sufficiently close to r, 

E(H(2/) - M(x))' = E(/(x)G(x) - fiy)G{y)f 

> fiyfEiGiy) - G{x)f + (/(y) - fix)fEG\x) 

>f{y?{s{y)-s{x)). 

Let Xn be such that s(x„) = e". Then s(x„) — s{xm) > s{xn){l — e~^) for 
n> m, whence for sufficiently large m and n> m, 

E(H(X„) - m{Xm)f > f{Xn)s{Xn) =■ a^, 

and hence dm{xk-,xi) > a2n for alln< k,l < 2n. So the points x^, x„+i, . . . , X2n 
are a2n-separated, and Sudakov's inequality implies that 

E sup |IHI(xfc)| >a2nVlogn>a2nVlog2n. 

n<k<2n 

If M(x) almost surely as x t r, then the left-hand side tends to zero, and 
we conclude that logn — > as n — > oo. Together with the monotonicity of 
/^s this implies the necessity of the right tail condition. The condition on 
the left tail can be seen to be necessary in the same way. □ 
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Lemma 2.8. Let Xn,X -.0, ^ T> be maps from a complete probability 
space (fi,^, P) into a complete metric space T>. If Xn is Borel measurable 
and tight for every n, and d{Xn, X) ^ in outer probability, then X is Borel 
measurable and tight. 

Proof. The map X is Borel measurable, because the convergence in 
outer probability implies the existence of a subsequence that converges al- 
most surely. The pointwise limit of a sequence of Borel measurable maps 
into a metric space is itself Borel measurable. 

If F* {d{Xn, X) > (5) — > for every 6 > 0, then there exists a sequence 
5n 10 such that F*{d{Xn, X) > 6n) — > 0. Hence given some e > we can find 
a subsequence ni < n2 < • • • such that F* {d^Xn^ , X) > Snj) < £2~^ for every 
j S N. By the tightness of X„ for a fixed n, we can find a compact set Kn 
with P(X„ <e2-". 

The set C = f]jKn"/ , where = {x:d{x,K) <6}, is totally bounded. 
If this were not the case, there would be r/ > and a sequence {xm} C C 
with d{xm,Xm') > V for every m^m' . Fix j such that 45„^ < t]. There exists 
{Um} C with d{xm,ym) < for every m, and by compactness of Kn^ 
this has a converging subsequence. The tail of the sequence Xm would be in 
a ball of radius 2(5^^ around the limit, which contradicts the construction of 
{xm}- Thus C is totally bounded, and hence its closure is compact. 

If Xn G Kn for every n and d^Xn^ , X) < 6nj for every j, then X £ C. We 
conclude that F{X ^ C) < 2e. □ 

2.4. Empirical distribution function. Let J be an arbitrary subset of /. 
The empirical process Gj indexed by the class of functions = {t(i^x] - x & J} 
is the restriction of Vi{Ft — F) to J, where Ft is the empirical distribution 
function, defined by 

1 /■* 

^t(^) = jy^ H,x]iXu)du. 

Kutoyants [13], Negri [18] and Kutoyants and Negri [15] studied this object 
for a certain class of stochastic differential equations. In particular, Negri 
[18] proved that for these particular models, Vi{Ft — F) converges weakly to 
a Gaussian limit, uniformly on the entire state space. We extend their results 
to general regular diffusions and obtain necessary and sufficient conditions 
in terms of the scale function and stationary distribution. 

In our general setting, it follows from the classical central limit theorem 
that the finite-dimensional distributions of Vt{Ft — F) converge weakly to 
those of a centered, Gaussian random map EI with covariance function 

EH(x)EI(y) = Am{I) j{F{u Ax) - F{u)F{x)){F{u Ay)- F{u)F{y)) ds{u). 
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For uniform weak convergence we can give a necessary and sufficient in- 
tegrability condition, analogous to the preceding result for the local time 
estimator. 

By the occupation times formula (1.4) 

Ftix) - Fix) = [ ik/t - l/m(/)) dm. 

This suggests that a version of the limit process H is given by the process 
]HI(x) = G dm for G the limit process of the diffusion local time process 
obtained in Theorem 2.1. In the proof of the following theorem it is seen 
that this integral is indeed well defined, in an L^-sense, and gives a version 
of M. 

Theorem 2.9. (i) We have Vt{Ft -F)^Min (.°°{J) for every com- 
pact J I and some x £ I if and only if Jj F^(l — F)'^ ds < co. 

(ii) We have \/t{Ft - F) ^ EI in £°°{I) for some x £ I if and only ifU 
admits a version such that M(x) almost surely as x [l or x] r. 

Proof. For any x £ I, we have 

EB.'^ix) = 4m(I) (^{1 - F{x)f ds + F'^{x) j\l - Ff ds 

Therefore, the integrability of the function F^(l — F)^ relative to s is equiv- 
alent to the existence of the covariance process of the limit process H. It is 
clearly necessary for both (i) and (ii). 

If Xn is such that s(a;„,) = , then the integrability and monotonicity 
of (1 — F)^ near r imply that X]n(l ~ F)^(x„)(s(a;„) — s{xn~i)) < oo. Be- 
cause s{xn)/s{xn-i) = 6, this implies that (1 — F)'^{xn)s{xn) 0. Again by 
monotonicity of F we obtain that (1 — F)'^{x)s{x) — > as x ^ r. Similarly 
F^(x)s(x) -^0 as x [l. 

The process G of Theorem 2.1 possesses continuous sample paths and 
hence is integrable on compacts J C I. By straightforward calculations we 
see that, for a < 6 in /, 



(2.2) 



b 

' (1{,.<„| - F(u))(l{^<„| - F{u)) ds{u) dF{x) dF{y) 



{F{h) - F{a)f (^j^ ds + J\^- F? ds 
+ (\f{1 - F{h)) - (1 - F)F{a)f ds. 

J a 
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The last integral on the right-hand side is bounded above by 2(1 — F(6))^(|s(6)| + 
C) + 2F2(a)(|s(a)| + C) for a constant C [depending on /F2(1 - Ff ds\. 
Combined with the result of the preceding paragraph and the assumed in- 
tegrability of the function — F)'^ it follows that jj^Gdm — > in as 
a — > / and r. Similarly, the same is true if both a I and h^l, whence 
the integral EI(6) = f^ G dm is well defined in the L^-sense. It can be checked 
that it gives a version of the limit process H. 

(i) It suffices to prove that there exists a version of the limit process IH 
with sample paths that are bounded and dn-uniformly continuous on the 
compact J C /. In view of the preceding we have that 

E(EI(a) - U{b)f = ;^(E|M(a) - M(6)|)^ < (^*E|G| dF^ 
< ( sup ElG('u)l) {F{b)-F{a)f 

\a<u<b J 

<{l + \s{a)\V\s{bmF{b)-F{a))\ 

by (2.1). It follows that for every given compact J C I there exists a constant 
C such that dM{x,y) < C\F{x) — F{y)\ for all x,y £ J. Since F maps J into 
the compact interval [0,1], this implies that (J, de) has finite entropy inte- 
gral and hence M admits a version with bounded and uniformly continuous 
sample paths on J. [We can also apply Corollary 1.4 to see directly that 
Vt(Fj - F) ^ M in i°°{J) if J is compact.] 

(ii) Because the sample paths of the processes Vi{Ft — F) tend to zero 
at the boundary points of /, this must be true also for the limit process H. 
Therefore, the existence of a version with this property is certainly necessary. 
We can argue the sufficiency in exactly the same manner as in the proof of 
Theorem 2.6. □ 

The following corollary gives a simple sufficient condition for the sample 
paths of H to vanish at the boundary of /, as required in (ii) of the preceding 
theorem. 

Corollary 2.10. Suppose that Jj F^{1- Ff ds < oo. If {1- F)^{x)s{x) x 
loglogs(a;) ^0 as x ] r and F'^{x)s{x)loglog\s{x)\ as x I I, then the 
convergence 

^/i{Ft -F)^M 

takes place in i°°{I), for every x € I. If the functions (1 — F)^s and F^s are 
monotone near r and I, respectively, then these conditions are necessary. 
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Proof. It suffices to sliow tfiat tfie sample patlis of tlie process M tend 
to zero at tlie boundary points of /. 

Choose tlie sequence x„ such that = e". Then s{xn)/s{xn~i) = e 

and hence, for m <n, with 6^ = (1 — F)'^{xn)s{xn), 

„^ n n n—1 

J " (1 - Ff ds<Y^{l- F)\x,)s{xu) =J2bl<Y: hi 

^'"^ k=m k=m k=m 

From (2.2) it can be seen that a multiple of the right-hand side of this 
equation is a bound on 'E{M.{xn) — BI(xm,))^. 

By the bounds given in the preceding proof, for a,6G [xn-i,Xn], 

E(H(a) - H(6))2 < (F(6) - F{a)fs{xn) =: (a, b). 

In particular, for x G we have that E(EI(x) — M.{xn))'^ ^ ^n-i- It 

also follows that 

iV(e, [x„_i,x„],e„) <n( [F{xn-i), Fixn)], \ ■ \) < 

Therefore, by Talagrand [28] , for all A > and sufficiently large n and some 
constant C, 

(2.3) P ( sup (M(x) - M(x„)) > a) < e-^^'/^"-i . 

\X„—i<X<Xn / 

If 6^1ogn 0, then the series obtained by summing the right-hand side 
over n is convergent for any A > 0. In view of the definitions of bn and 
Xn this is the case under the condition of the corollary. This implies that 
limsup„^o^ sup^,^_^<;^<2,^(EI(x) — IHI(3;„)) < almost surely, as n ^ oo. By 
a similar argument on the other tail we see that sup^^_^^^^^^ \Il{x) — 
E[{xn) \ almost surely. 

Given a sequence of independent zero-mean Gaussian random variables 
Xi,X2,... with varXj = bf, let Wn = J2i^n-^i- Because J^k^l < oo, the 
series Wn converges in and hence also almost surely, by the Ito-Nisio 
theorem. Thus the variables Wn form a well-defined Gaussian process and 
Wn almost surely as n — > oo. As noted in the preceding we have that 
E(M(x„) - H(x„))2 < EkZlbl = EiWn - Wmf for every n,m G N. This 
inequality remains true for m,n G N U {oo} if we set EI(xoo) = Woo = 0. 
Therefore, by Slepian's lemma, 

Esup|E[(xfc)| <E sup (]H(3;fc)-IH(xz))<2EsupEI(a;fe)<EsupWfc. 

fc>?i oo>k,l>n k>n k>n 

Because the sequence sup;.>„ converges to zero in probability as n — > oo, 
its sequence of medians converges to zero. In view of Borell's inequality the 
same is then true for the sequence of means. Combined with the preceding 
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display this shows that sup^>„ |]HI(xfc)| converges to zero in probabihty, and 
hence IHI(x„) almost surely. 

By combining the results of the two preceding paragraphs we see that 
^^Px>x„ ~^ almost surely. A similar argument applies to the limit of 

M at the left boundary of /. This concludes the proof of sufficiency of the 
condition for the Donsker property. 

If the function (1 — F)^s is decreasing near r, then 1 — F{xn) < e^™~"^/^(l — 
F{xm)) for n > m large enough and hence F{xn) — F{xm) > (1 — F{xm))i^ ~ 
g-i/2^_ From (2.2) it follows that, for n> m and sufficiently large m, 

E{M{Xn) - MiXm)f > {F{Xn) " F{Xra)f j^^ F^ ds > {1 - F{Xm)fs{Xm). 

Arguing as in the proof of Corollary 2.7 this yields the necessity of the right 
tail condition. The condition on the left tail can be seen to be necessary in 
the same way. □ 



Because the set of indicator functions of cells in the real line is a VC-class, 
we can deduce the assertion of the preceding corollary also from Theorem 2.2 
under the condition 

j \l\s\dm < oo. 

For distribution functions F and scale functions s with regular tail behavior 
this condition appears to be generally stronger than the condition of the 
preceding corollary. For instance, if s{x) = x and 1 — F{x) = x~^''^(log3;)~" 
for large x, then the right tail of the integral in the preceding display is finite 
if a > 1, whereas (1 — F)^(x)s(x) loglogs(a;) — > as x — > oo for any a > 0. 
More generally, we have the following relationships between the conditions, 
where we state the results for the right tails only. 



Lemma 2.11. Suppose that Jj ^/\s\ dm< oo . Then: 

(i) JjF\l-F)^ds<oo. 

(ii) (1 - Ff{x)s{x) -^0 as x]r. 

(iii) If {I- Ff{x)s{x) 10 asx'lr, then {I - F)'^ dslogs{x) ^ 0. 

(iv) ///^°°(1-F)2dsloglogs(a;)^0, then {1 - Ff{x)s {x)loglogs{x) ^ 0. 



Proof. By Markov's inequality we obtain, with Xt a stationary diffu- 
sion, for X such that s{x) > 0, 

l-Fix) = P{^/^)>^))<-l= r ^sdF 

•\J S(^X) J X 
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In particular, the function (1 — F)^/s tends to zero at the right endpoint of /, 
proving (ii). Then partial integration gives that, for xq such that s{xo) = 0, 

(2.4) / V~sdF = - / —(l-F)ds. 

We conclude that finiteness of the two integrals in the display is equivalent. 

(i) Because ^^(1 - F)^ < (1 - F)/^ we obtain that /J^ F^{l-F)^ds< 
oo. Convergence of this integral at the left endpoint of / is proved similarly. 

(iii) Define Xn by s{xn) = e". Integrability of the function (1 — F)/y/s at 
the right end of / implies that J2ni^ ~ F){xn)e"'^'^ < oo. Because the sequence 

(1 — F)(a;„)e"/^ is decreasing by assumption, it follows that (1 — F){xn)e^^'^n — > 0. 
(Indeed, if X^^n < oo and a„ is decreasing, then oo > X^fc Z]2*:-i<n<2'-- '^n ^ 
2^~^a2fe-i , so that 2'^ 02*: ^ as A; ^ 00. It follows that sup2fe-i<„<2fe fian ^ 
2'^~-^a2fe-i 0, so nan — > 0.) Hence, 

^ (1 - Ff{xn)s{xn) < (1 - F)(x„Je"o/2 ^ (1 - F)(x„)e"/2 = O(l/no), 

n>no n>no 

as no ^ 00. This implies that logs(3;„Q) (1 — F)^ — > 0. 

JlQ 

(iv) Withx^ as before, we have (1 -F)2(a;„(,)s(x„J < En>no(l~-^)^(^") ^ 
s{xn)i which is bounded above by a multiple of /^^ _^(1 — F)^ ds. □ 

On the other hand, it is not true in general that the condition \/\s\dm < 
00 is stronger than the condition of Corollary 2.10 and hence the latter con- 
dition is not necessary in general. This is also clear from the proof, which is 
based on the assumption that the right-hand side of (2.3) yields a convergent 
series. Without some regularity on the sequence b^, this does not reduce to 
the simple condition as stated. 



Example 2.12. Define a sequence Xn by log log log log log s {xn ) = n (where 
we use the logarithm at base 2), and define 

1 — F(x) = Xn-l < X < Xn- 

VS (a^n ) V log log log log s{Xn) 

Then Jj y/\s\dm < 00, but (1 - F)'^{xn) s(x.„) = (loglogloglogs(xn))~^. 

Because this distribution function F possesses flat parts, it cannot appear 
as the stationary distribution of a regular diffusion. However, by moving a 
tiny fraction of the total mass, we can construct a distribution with full 
support without destroying the preceding properties. 
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3. Continuous martingales and majorizing measures. Let (il, {JTj}, P) 
be a filtered probability space. On this stochastic basis, suppose that we 
have a collection M = {M^ : 6 S 0} of continuous local martingales = 
{Mf : t > 0), indexed by a countable pseudo-metric space (0, d). The quadratic 
d-modulus of continuity \\M\\d of the collection M is the stochastic process 
defined by 



Here (A^) denotes the quadratic variation process of the continuous local 
martingale N . 

The quadratic modulus was introduced explicitly by Nishiyama [21, 22] 
and appeared already implicitly in the papers Bae and Levental [2] and 
Nishiyama [20]. The relevance of the quadratic modulus stems from the 
fact that for every time t > and every constant X > 0, the random map 
9^ Mf IjiiAf j<ic} is sub-Gaussian with respect to the pseudo-metric Kd. 
Indeed, the Bernstein inequality for continuous local martingales (see, e.g., 
[26]) implies that 



For random maps whose increments are controlled in this manner, the theory 
of majorizing measures gives sharp bounds for the modulus of continuity. 
As before, we denote by N{rj,Q,d) the minimal number of balls of d-radius 
r] that are needed to cover 0. The symbol < between two expressions means 
that the left-hand side is less than a universal positive constant times the 
right-hand side. 

Lemma 3.1. For all 6,x,r] > 0, K >1, every Borel probability measure 
v on {Q,d), and every bounded stopping time r, 



M\\d,t 



sup 

9,ip:d{9,tp)>0 




<F{\mI> -Mf\>x,\\M\U,t<K) 

< P(|Mf - m/I > X, {Af - M'f')t < K'^d\d,i;)) 
<2e-(i/2)xV(^''i'(e,^)). 




sup sup 
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Proof. We may of course assume that the right-hand side of the in- 
equahty in the statement of the lemma is finite. Introduce the stopping time 
tk = inf{t: ||M||rf^t > K}, so that the probabihty in the statement of the 
lemma is bounded by F{supf-^^Xf > x), where 

(3.1) Xf= sup \M^^^t-Mlj. 

d{e,tl>)<5 

By Bernstein's exponential inequality for continuous martingales we have 
for all a > and every finite stopping time a 

p(|m4^,-mVI>«) 

< 2e-(V2)aV(^'d'(9,^))_ 

Hence, the random map 9 ^ -^'^ta-ao- sub-Gaussian with respect to the 
pseudo-metric Kd. By formula (11.15) on page 317 of [16] this implies that 
for all ?7 > 



(3.2) EXi<K(..^[^Xo^-^^^4. + 6-M^)), 

where Bd{S,,e) is the ball around with d-radius e. In particular, we see 
that ^Xf < oo for every t > 0. Also, for any pair {9,ijj) and for every finite 
stopping time cr, by the Davis-Gundy inequality. 

Thus, the collection {M^^^^^ — M^^^. : o" is a finite stopping time} is bounded 
in and therefore uniformly integrable. This implies that the stopped local 
martingale M^^^^ — Mf^^^ is of class (D), which means that it is in fact a 
uniformly integrable martingale (see, e.g., pages 11-12 of [10]). It is then easy 
to see that the process defined by (3.1) is a submartingale. Hence, by the 
submartingale inequality, P{sup^^^ Xf > x) < FjX^/x. In combination with 
(3.2) this yields the statement of the lemma. □ 

With the help of this lemma we can prove results concerning the regular- 
ity and asymptotic tightness of collections of continuous local martingales 
under majorizing measure conditions. The key condition is the existence 
of a pseudo-metric d on for which the modulus is finite or bounded in 
probability and for which there exists a probability measure v such that the 
integral on the right-hand side in the preceding lemma is continuous at zero. 
The latter is the continuous majorizing measure condition: 



(3.3) limsup / Jlog ]^ — —de = 0. 
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The first theorem deals with regularity of a given collection of local martin- 
gales M. 

Theorem 3.2. Suppose there exists a Borel probability measure v on 
{Q,d) such that (3.3) holds for a pseudo-metric d on Q for which ||Af H^^,- < 
oo almost surely. Then the random map 6 i— > is almost surely bounded 
and uniformly d-continuous on Q. 

Proof. By Lemma 3.1, there exists for every n € N a positive number 
5n such that for every K,x>0, 

pf sup \M^-Mf\>x;\\M\U,r<K)<^. 

For every n, define the event 

An = \ sup \M'^-Mf\>l^]. 

Then for every i^T > we have EP(^n; \\M\\d,r < K) < KY.'^-'' < oo. So 
by the Borel-Cantelli lemma, P(A„infinitely often; ||M||rf < K) = 0. Since 
||M||rf^T- is almost surely finite by assumption, it follows that 

P{An infinitely often) = P(A„ infinitely often; ||M||rf < oo) 

<^P{An infinitely often; \\M\\d,r<K) = 0. 

K 

So we almost surely have that supd{e, ip)<5„ — Mf\ < 2""- for all n large 
enough, which implies that the random map 9 i— > is uniformly continu- 
ous. Recall that under the majorizing measure condition, the pseudo-metric 
space (0, d) is totally bounded (see, e.g., the proof of Lemma A. 2. 19 of [29]). 
It follows that 9 ^ Af^ is bounded with probability 1. □ 

Suppose now that for each n G N, we have a collection M" = {M"'^ : 9 G 
Q} of continuous local martingales and a finite stopping time Tn on a stochas- 
tic basis (r2'^,jF",{jr"},P"). For each n the local martingales M"''^ are in- 
dexed by a parameter 9 belonging to a fixed pseudo- metric space B. Recall 
that a sequence X„ of £°°(0)-valued random elements is called asymptot- 
ically d-equicontinuous in probability if for all £,r]> there exists a 5 > 
such that 

limsupPf sup \Xn{9) - Xn{ip)\> e) <r]. 

Weak convergence in £°°(0) to a tight limit is equivalent to finite-dimensional 
convergence and equicontinuity with respect to a semimetric d such that 
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(0,(i) is totally bounded (see, e.g., [29], Theorem 1.5.7). For the random 
maps 6<—>-M^^^, finite-dimensional weak convergence will typically follow 
from a classical martingale central limit theorem (cf. [10]). Using Lemma 3.1, 
it is straightforward to give sufficient conditions for asymptotic equiconti- 
nuity in terms of the quadratic modulus and majorizing measures. The next 
theorem extends Theorem 3.2.4 of [22], which gives sufficient conditions for 
asymptotic equicontinuity in terms of metric entropy. 

Theorem 3.3. Suppose there exists a Borel probability measure v on 
(6, d) such that (3.3) holds for a pseudo-metric d onQ for which HM^H^.r^ = 
Op{l). Then {Q,d) is totally bounded and the sequence of random maps 
9<-^ M^^^ in £°°{@) is asymptotically d-equicontinuous in probability. 

Proof. The total boundedness of {Q,d) is a direct consequence of the 
existence of a majorizing measure. See, for example, the proof of Lemma 
A.2.19 of [29]. 

Let the random map Xn on @ be defined by Xn{0) = M^'^. Then for 
every K > 

p( sup \Xn{e)-Xnm>e] 
\d{e,ip)<5 J 

<pf sup |X„,(^)-X„(V^)|>e;||M"||d,,„<K')+P(||M"Hrf,,„>i^). 

Vd(6l,?/))<5 / 

Now if ?7 > is given, we can first choose K large enough to ensure that 
limsupP(||M"||d.T-„ > -f^) < '^/2. Lemma 3.1 implies that for this fixed 
we can choose a 5 > such that the first term on the right-hand side is less 
than 7y/2. □ 

The preceding theorems do not use the full power of Lemma 3.1, because 
they use the control in Q of the local martingales t ^ M"'^, but not the 
control in the time parameter t. In the following theorem we use the lemma 
to establish a majorizing measure condition for the asymptotic tightness in 
^°°([0,r] X 9) of random maps of the form (t,Q) ^ M"'^ for fixed T € (0,cx)). 

We make the same assumptions as in the preceding theorem, and in addi- 
tion assume that for every fixed 6^0 the sequence of processes (M^ ' : < 
t <T) is asymptotically equicontinuous in probability relative to the Eu- 
clidean metric on [0,T]. By the martingale central limit theorem, this is 
certainly true if the sequence of quadratic variation processes (M"'^) con- 
verges pointwise in probability to a continuous function (which is then the 
quadratic variation process of the Gaussian limit process). 
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Theorem 3.4. Suppose there exists a Borel probability measure v on 
(G, d) such that (3.3) holds for a pseudo-metric d onQ for which ||M"||d.T-„ = 
Op(l). Furthermore, assume that, for every fixed 6 € Q, the sequence of 
processes {Mj^'^ : < t < T) is asymptotically equicontinuous in probability 
relative to the Euclidean metric. Then the sequence of random maps M" is 
asymptotically tight in the space £°°{[0,T] x Q). 

Proof. By the majorizing measure condition (3.3) the set is totally 
bomided under d. If 6i,...,6m is a 6-net over and s,t£ [0,T], then for 
all i 

- M^'^l < |M,"'^^ - Mj^'^'l + 2 sup |Mf - Mj"'^'|. 

0<t<T 

Hence 

sup sup \M^'^ - Mj^'^^l 
\s-t\<jd{e,ip)<s 

< sup sup (|M;'^-Mi"'ViMr'^-Mr''^i) 

\s-t\<'yd{e,ilj)<5 

<max sup |M,"'^' -M"'^"| +3 sup sup |M"'^ - M"''^]. 

* |s-t|<7 0<t<T d{e,i')<5 

Fix e,rj > 0. Extending the argument in the proof of Theorem 3.3, we can 
show that there exists 6 > such that 

(3.4) limsuppf sup sup |M"'^ - M"''^| > < r/. 

n^oo \0<t<T d{e,tlj)<S / 

For this 6 = 6{e,ri) there exists a finite 6-net 9i,...,6m over (where m 
depends on 6). By the assumption of asymptotic equicontinuity of the pro- 
cesses 1 1— > M"'^, there exists 7 = 7(77, m,9i, . . . , 9m) such that 

limsuppf sup |M,"'^' -M"'^"| >e) < ^, i = l,...,m. 
Combining the preceding displays we see that 




Thus for the given pair (e, if) we have found a pair (7, 6) of positive numbers 
such that this holds. Because the probability on the left-hand side is increas- 
ing in 7 and 5, the bound remains true if we replace 7 or (5 by the smaller of 
the two. This implies that the sequence of processes M" is asymptotically 
equicontinuous in probability relative to the product of the Euclidean metric 
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on [0,T] and the pseudo- metric d on 0, and hence it is asymptoticahy tight 
([29], Theorem 1.5.7). 

□ 

In the preceding theorem we can also use an arbitrary pseudo-metric for 
which the interval [0, T] is totally bounded (and this could be permitted to 
depend on 9), rather than the Euclidean metric. However, because the local 
martingales 1 1-^ ' are continuous relative to the Euclidean metric by 
assumption, this apparent generalization would not make the theorem more 
general: the necessary continuity of the limit points 1 1— > would imply that 
the equicontinuity necessarily also holds relative to the Euclidean pseudo- 
metric. For simplicity of the statement we have used the Euclidean metric 
throughout. 

4. A limit theorem for diffusion local time. In this section we collect 
some classical and some less well-known facts about diffusion local time. We 
shall need these in the proof of Theorem 1.2. As in the Introduction, let 
X be the regular diffusion on the open interval /. A central result in the 
theory of one-dimensional diffusions is that diffusions in natural scale are in 
fact time-changed Brownian motions; see, for instance, [25] or [11]. In our 
setting, we have that under Px, it holds that s{Xt) = WVt, where is a 
Brownian motion that starts in s{x), and Tt is the right-continuous inverse 
of the process A defined by 

A = J^L'^{siy))midy). 

Here = {Lf {y) : t > 0, y € M) is the local time of W. It follows from this 
relation that the local time lt{y) of X with respect to the speed measure m 
satisfies lt{y) = Lf^{s{y)). 

This time-change representation of diffusion local time shows that with 
probability 1, the random function y^lt{y) can be chosen continuous and 
has compact support. In particular, it holds that ||/f ||oo = snY>y^ilt{y) < oo 
almost surely. In [30] it is shown that in fact, ||/f ||cxd = Op{t) as t — > oo. 
For the sake of easy reference, we include a proof of this fact. We need the 
following lemma. 

Lemma 4.1. For every x £ I we have, for Z standard normally dis- 
tributed and t — > oo, 

t2 ^ m2(/)Z2' 
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Proof. The process A defined above is a continuous additive functional 
of W, and since m is finite, it is integrable. By Proposition (2.2) in Chapter 
XIII of [24], it follows that 

(4.1) i=^,l^m(/)Lf(0), 

where is the local time of a Brownian motion B that starts in 0. The 
process r is the right-continuous inverse of A, so for every t,T >0 it holds 
that < T if and only if At > t. By (4.1), it follows that, for every z>0, 

P.(^<.)=P.(^..>t) = P.(^^..>^) 

^P,(^m(/)Lf(0)>-^ 

= P ( I <, 

%m2(I)(Lf(0))2 

To complete the proof we use the well-known fact that (Lj^(O))^ has a Xi- 
distribution (see [12], Theorem 3.6.17 and Problem 2.8.2). □ 

Theorem 4.2. For every x £ I we have \\lt\\oo = Op^{t) ast^oo. 

Proof. Let us write at = t~^\\lt\\oo- We have to prove that at is asymp- 
totically tight for i — > oo. By the time-change relation, we have for all a,b> 



Fxioit>a) = Fx( sup jL^^rt/t2){z)>a] 



<fJ sup lL,\(z)>a)+PJ-|>6 

By the scaling property of Brownian local time (see Exercise (2.11) in Chap- 
ter VI of [24] and note that is a Brownian motion starting at s{x)) it 
holds under P^, that 

sup h^^iz)^ sup L„^(^^)=supLf(z), 

zeR,u<b t z€R,u<b \ t / zGR 

where is the local time of a standard Brownian motion B (starting in 
0). So we find that for all a,b> 



(4.2) P,(at >a)<P,(^supLf(z) >aj +F,i^^>b 

The proof is finished upon noting that z L^^iz) is bounded (because con- 
tinuous with compact support), almost surely and Tt/t'^ is asymptotically 
tight. □ 
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5. Proof of Theorem 1.2. 

5.1. Reduction to the natural scale case. Let us first show that it suffices 
to prove the theorem for diffusions X that are in natural scale (i.e., for 
which the identity function is a scale function). The diffusion Y = s{X) is in 
natural scale (see, e.g., Theorem V.46.12 of [25]), and we have the relations 

m = m^os, lt = lJos, F = F^os, 

between the local time Z^, speed measure mX and stationary distribution 
of Y, and the local time I, speed measure m and stationary distribution 
F of X . Moreover, 

EM^{X o s~^)EI^(i/ o s"^) = EMAMi/. 

It follows that the class A is Donsker for X if and only if the class A o = 
{A o s"^ : A G A} is Donsker for Y. So if we have proved the theorem for 
diffusions in natural scale, we can apply it to the diffusion Y = s{X) and 
the class A o s~^ to prove it for a diffusion X that is not in natural scale. 

In the remainder of the proof we therefore assume that X is in natural 
scale. The process X is then an ergodic diffusion in natural scale on the 
open interval /. Therefore, we must have I = M (see, e.g.. Theorem 20.15 
of [11]). Moreover, the fact that the state space is open implies that X is 
a local martingale (cf., e.g., [25], Corollary V.46.15). We also note that for 
diffusions in natural scale on an open interval, the diffusion local time lt{x) 
with respect to the speed measure coincides with the semimartingale local 
time of X (see [25], Section V.49). 

5.2. Asymptotic equivalence with uniform weak convergence of continuous 
local martingales. In this section we show that the weak convergence of the 
empirical process Mt is equivalent to the weak convergence of a normalized 
£°°(A)-valued continuous local martingale. Since X is now in natural scale, 
we have / = M. For every x E M, define the functions tTx and II^; on M by 
Tlx = 2(]l[^.^oo) - F) and 

ry 

^xiy) = / 7Txiu)du, 
Jyo 

where yo is an arbitrary, but fixed point in M. The function tTx is the dif- 
ference of two increasing functions, and hence lix is the difference of two 
convex functions. Moreover, we have the relation Hxib) — T^xio) = v{a,h] for 
all a<b, where i' is the signed measure i' = 2(5x — fj) on M, and 5x denotes 
the Dirac measure concentrated at x. So by the generalized Ito formula (see, 
e.g., [24], Theorem VI.1.5, or [25], 45.1) 

Iix{Xt) - Iix{Xo) = f\x{Xu) dXu + \ f itivHdy)- 
Jo Jr 
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It follows from the definition of v and the occupation times formula (1.4) 
that 

1 /■ /■ 1 

k{y)v{dy) = lt{x) - / lt{y)i^{dy) = lt{x) — -t, 



2 Jr 7r m{I) 

so that, under P^, 

7^*^^) - -im = 7(nx(Xi) - n,(z)) -\f ^,{Xu) dXu. 

t m(K] t t Jo 



If we integrate this identity with respect to X{dx) and use the stochastic 
Pubini theorem (see [23], Theorem IV. 45), we see that the empirical process 
Mt can be decomposed as 

(5.1) MtA = i2,,t(A) - ^Mi^ 

under P^, where M'^ is the continuous local martingale defined by 

(5.2) = 2 f\x{Xu)dXu with /ia(x) =A(/,x] -F(x)A(/), 

Jo 

and Rz^tW = t~^^'^ J^CnxiXt) — Ilx{z))X{dx). The next step is to show that 
the -R^^t-term vanishes uniformly in A, so that we only have to deal with the 
martingale part of Hj. The functions ir^ are bounded in absolute value by 
2, so we have the pointwise inequality |na;| < 11 for every x, where 11 is a 
function that does not depend on x. It follows that 

sup4=|n..(xo -n,.(z)| < 4=(n(Xt) + n(z)). 

Consequently, we have 

sup |i?,,t(A)| < sup ||A||^(n(Xt) + U{z)). 
AeA AeA Vt 

The right-hand side converges to in probability, since the law of Xt con- 
verges in total variation distance to the stationary measure /i as t — > oo, 
whatever the initial law (see, e.g., [25], Section 54.5). 

5.3. Continuity of the empirical process. In this section we prove that the 
empirical process is (in-continuous. Observe that we have MtX = Jj 4>t{x)X{dx), 
with 

cl>t{x) = Vt(ht{x)-^ 

Since the random function x lt{x) is almost surely continuous and has 
compact support, the random function (pt is bounded and continuous with 
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probability 1. Note also that by the occupation times formula (1.4), (j)t sat- 
isfies 

/ (f>t{x)m{dx) = [ ldu — Vt = 0. 
J I vtJo 

Hence, the de-continuity of the empirical process is a consequence of the 
following lemma. 

Lemma 5.1. Let ^ be a continuous function on I with compact support 
and (j) = 'ip — J ipdfi. Then if Xn and X are signed measures with \\Xn\\ bounded 
and duiXn, X) 0, it holds that 

(t){x)Xn{dx) / (l){x)X{dx) 



as n —> oo . 

Proof. Define a„ = Xn{I) — X{I). Since the total variation of the signed 
measures A„ is uniformly bounded by assumption, the sequence q„ is bounded. 
Hence, it has a converging subsequence, say a„' — > a. Observe that the con- 
vergence dM{Xn,X) implies that n' has a further subsequence n" such 
that 

Xn"{l,x] - Fix)Xn"{I) ^ X{l,x] - F{x)XiI) 

for almost every x £ I [recall that X is now in natural scale, so that s{x) = x]. 
So there exists a dense set D I such that for all x G D 

Xn"{l,x] = Xn"il,x] - F{x)Xn"{I) + F{x)Xn"iI) 

X{1, x] - F{x)X{I) + F{x){a + A(/)) = X{1, x] + aF{x). 

Since ^p is compactly supported and continuous, we can approximate it 
uniformly by functions ipm of the form tpm = Ei^i l(a„,,,a„,,+i]V'(«i,m) for 
o-mfl < am,i < • • • < am,m finite partitions of the support of ip. The preceding 
display shows that for every fixed m, as n — > cx), 

^pm{x)Xn"{dx) ^ J ipm{x)X{dx) + a J V'm(x)/i((ix). 

By the uniform approximation as m — > oo, this is then also true for ip in the 
place of ipm.. Consequently, 



(t){x)Xn"{dx) = I ^{x)Xn"{dx) — Xn"{I) J 1p d/I 

ip{x)X{dx)+a / ilj{x)fi{dx) — [a + X{I)) I il){x)fj,{dx) 



(j){x)X{dx). 
This completes the proof. □ 
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5.4. End of the proof . We can now finish the proof of Theorem 1.2. That 
the existence of a bounded and de-uniformly continuous version of the Gaus- 
sian hmit EI is necessary for A to be Donsker follows from the general theory 
of weak convergence to Gaussian processes. 

For the proof of the converse, suppose that such a bounded and uniformly 
continuous version EI exists. By Theorem 12.9 of [16], this implies that there 
exists a Borel measure u on [A, da) such that 



(5.3) limsup / \j^og—— — — — :—de = 0. 



riio X Jo ]l u{Bdi_^{X,e)) 

It follows that (A,de) is totally bounded (see, e.g., the proof of Lemma 
A. 2. 19 of [29]), and therefore separable. Hence, since the empirical process 
Ht is dn-continuous, we may assume that A is countable. 

By the considerations in Section 5.2, it suffices to show that the random 
maps EIj defined by EIjA = t~^^'^M^ converge weakly to H, where is the 
local martingale given by (5.2). By the ergodic theorem ([9], Section 6.8), 
we have 

^(M\M^)t = ^ f' hx{Xu)K{Xu)d{X)^ = ^ f hx{x)K{x)k{x)dx 
t t Jo t 

^' -4^ I hx{x)hJx) dx = EMAMz/ 
m(M) Jm. 

as t — > oo. So by the martingale central limit theorem, the finite-dimensional 
distributions of EIj converge weakly to those of EI. Now pick an arbitrary 
sequence a„ — > oo and apply Theorem 3.3 to the local martingales M"^'^ 
defined by 

and the stopping time r„ equal to 1. Then in view of (5.3), all that remains 
to be shown is that ||M"||rfj^^i = Op(l). Since 

-(M^-M-),„ = - / {hx{x)-K{x))\^{x)dx<-\\laA^dl{\v), 

a-n CLn JR an 



we have 

11^ L„.l= sup TTTT—^ ^—¥a. 



nu2 _ {l/an){M'-Mna„ ^ 1 

de(A,!^)>0 %l'^'^) 



Or, 



By Theorem 4.2 it holds that ||Za„||oo = Op{an), so indeed, ||M"||djji 
Op(l). 
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